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I. Introduction 


In this note we consider one aspect (arguments for convergence and 
stability via a variational approach) of least-squares formulations of parameter 
estimation problems for partial differential equations. Conceptually, one has a 
dynamical model with "states" u = u(t,x), 0«t$T, xen, and parameters 
q = q(t,x) in some admissible set Q, Given observations or data, z e Z, 
of some type (e.g., z = {Ujj} as observations for (u(1^,Xj)}), one wishes to 
determine parameters q that give a best fit of the model to the data. That 
is, one has the constrained optimization problem: From an admissible 

parameter set Q, choose a parameter q so that the corresponding solution of 
the dynamical model gives the best fit to data using a least-squares fit 
criterion. 

Abstractly, we have a state space H in which we solve a dynamical 
system (S) for parameter dependent solutions u = u(q) with the parameters 
chosen from some infinite dimensional set Q. If C: H -* Z is a mapping 
from the state space to the observation space Z, the problem is one of 
minimizing 


2 

(1.1) J(q,z) = |Cu(q) - z | 

over q € Q, where \-\z is an appropriately chosen norm in Z. 

The fact that many problems of interest are infinite dimensional in both 
state spaces H and parameter sets Q leads to a rich class of mathematical 
questions including well-posedness, stability, and computational approaches. For 
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example, consider the possibility of approximating the state space H by a 
sequence of finite dimensional spaces and approximating the parameter 

set Q by a sequence of finite dimensional sets so as to obtain 

approximating problems: Minimize 


( 1 . 2 ) 


J^(q,z) = ICu^(q) - z| 

Z 


over q e Q^, where u^ is an approximate solution to (S) lying in An 

important question concerns the ways in which approximating Q and 

approximating H might guarantee convergence of solutions of 

the problems of minimizing over to a solution q of the problem of 

minimizing J over Q. A number of results [8], [27] in this area are available 
and we just sketch one set of arguments here (for examples and more details, 
see [8]) 

Suppose that the sets Q and lie in some metric space Q and that, 

in fact, there is a mapping i^: Q -* so that = i^(Q). Further, 

assume that the following hypotheses are satisfied by and 

(i) For any q'' -* q in Q we have Cu'^(q’') -* Cu(q) in Z as N,k -* “>; 

(ii) For each N, the mapping q -» J^(q,z) is continuous in the Q 

topology; 

(iii) The sets Q and Q^, for each M, are compaet in the Q topology; 

(iv) For each q € Q, i^(q) -• q in Q with the convergence uniform in 
q 6 Q. 
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Under these assumptions, let be solutions for the problems for (1.2) and 

let e Q be such that i^(q^’^) = q^’^. From the compactness of Q, 

we may select subsequences, again denoted by (q^’^} and {q^’^} so that 
-► q e Q and q^’^ -♦ q (the latter follows from (iv)). The optimality of 
qN,M guarantees that for every q e Q 

(1.3) jN(qN-'^,z) ^ jN(iM(q),z). 

Using (i) and (iv) and taking the limit as N,M -» ® in this inequality yields 
J(q,z) $ J(q,z) for every q e Q, or that q is a solution of the problem for 
(1.1). (Under uniqueness assumptions on the problems, one can actually 

guarantee convergence of the entire sequence in place of subsequential 

convergence to solutions.) 

We note that the essential aspects in the arguments sketched here involve 
compactness assumptions on the sets and Q. Such compactness ideas play 

a fundamental role in other theoretical and computational aspects of these 
problems. For example, one can formulate distinct concepts of problem 
stability and method stability involving some type of continuous dependence of 
solutions on the observations z in Z, and use hypotheses similar to (i) - (iv), 
with compactness again playing a critical role, to guarantee stability. We 
illustrate with a simple form of method stability (other stronger forms are also 
amenable to this approach). 

We might say that an approximation method, such as that formulated 
above involving Q^, and (1.2), is stable if 
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dist (q^>^(z''),q( 2 °)) -» 0 

as N,M,k -♦ 0 for any ^ z° in Z, where q(z) denotes the set of all 

solutions of the problem for (1.1) and q^’^(z) denotes the set of all solutions 
of the problem for (1.2). Here "dist" represents the usual distance set function. 
Under hypotheses (i) - (iv) one can use arguments very similar to those 

sketched above to establish that one has this method stability. If the sets 
are not defined through a mapping i^ as supposed above, one can still 
obtain this method stability if one replaces (iv) by the assumptions: 

(V) If {q^} is any sequence with q^ e Q^, then there exists q* in Q 
and subsequence {q^^ with q^ -» q* in the Q topology; 

(vi) For any q e Q, there exists a sequence (cf^) with q^ e 

such that c^-* q in Q . 

Similar ideas may be employed to discuss the question of problem 
stability for the problem of minimizing (1.1) over Q - i.e. the orginal 

problem and again compactness of the admissible parameter set plays a critical 
role. For discussions of other questions related to problem stability, see [19], 
[21] - and specifically Remark 5.1 of [21]. 

Compactness of parameter sets also appears to play an important role in 
computational considerations. For example, in certain problems the formulation 
outlined above (involving = I^(Q)) results in a computational framework 

wherein the and Q all lie in some uniform set possessing compactness 

properties. The compactness criteria can then be reduced to uniform 
constraints on the derivatives of the admissible parameter functions. We have 



-5- 


numerical examples which show that imposition of these constraints is 
necessary (and sufficient) for convergence of the resulting algorithms. (This 
offers a possible explanation for some of the numerical failures of such 
methods reported in the engineering literature -e.g. see [37].). 

Thus we have that compactness of admissible parameter sets play a 
fundamental role in a number of aspects - both theoretical and computational- 
in parameter estimation problems. This compactness may be assumed (and 

imposed) explicitly as we have outlined here, or it may be included implicitly 
in the problem formulation through Tychonov regularization as recently 
discussed by Kravaris and Seinfeld [25]. In the regularization approach one 

restricts consideration to a subset Qj of parameters which has compact 

imbedding in Q, modifies the least-squares criterion to include a term which 
insures that minimizing sequences will be bounded and hence compact in 

the original parameter set Q. 

Having made a case for the role that compaetness of admissible 

parameter sets might play in parameter estimation problems, we turn finally to 
the (not unrelated) focus of this note. In particular, we wish to discuss some 
problems in which a variational formulation (as opposed to the semigroup 

approximation framework we have used in many of our previous discussions of 
these problems - see [3,4,5,7,12,13,17]) permits relaxation of the compactness 
criteria needed in convergence, stability and/or eomputational analyses. We 
present several problems for which the variational framework can be used to 
give convergence arguments in the spirit of techniques commonly used in the 
finite-element approach (see [22] and the references therein) to initial-boundary 
value problems for partial differential equations. As we shall see below, the 
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"energy functionals" in our case are parameter dependent and the arguments 
can become somewhat tedious in some instances. 

In the next two sections we discuss problems for which the variational 
approach offers an alternative to the semigroup formulation. However, there 
are some problems for which the semigroup approach is not readily employed 

but for which a variational framework is rather natural. We present two 

such examples in Sections 4 and 5. 

To facilitate our discussions, in some cases we restrict our remarks to 

problems in which we minimize J and of (1.1) and (1.2) over a fixed set 

Q, relegating the role that approximating sets play to comments and 

referring the reader to [8] for an explanation of how one readily extends the 
ideas to problems of minimizing over where approximates Q. 
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II. A "1-D Seismic" Inverse Problem 


We consider the system 

a au 

(2.1) q^(x) ^ = — (qjCx)— ) t > 0, x e n = (0,1) 

ot cx ox 

(2.2) T^(t,0) + q3u(t,0) = f(t) 
ox 

au au 

(2.3) — (t,l) + q,— (t,l) = 0 

at ^ ax 

(2.4) u(0,.) = Uq , Ut(0,.) = Vq 

and the associated inverse problem of estimating q^ qj, qg, q 4 , f, given a 
set of observations {yjj} for (u(tj,Xj)}. 

Such problems are motivated by certain versions of the so-called "1-D 
Seismic Inversion Problem" (see, e.g. [2], [18]). Roughly speaking, one has an 
elastic medium (e.g., the earth) with density q^ and elastic modulus q^. A 
perturbation of the system (explosions, or vibrating loads from specially 
designed trucks) near the surface (x=0) produces a source f for particle 

disturbances u that travel as elastic waves, being partially reflected due to 
the inhomogeneous nature of the medium. An important but difficult problem 
involves using the observed disturbances at the surface or at points along a 
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"bore hole" to determine properties (represented by parameters in the system) of 
the medium. In the highly idealized 1-D "surface seismic" problem, one 

assumes that data are collected at the same point (x=0) where the original 
disturbance or "source" is located. In addition to this hypothesis, other 
unrealistic special assumptions are made about the nature of the traveling and 
reflected waves. Although the standard 1-D formulations are far from reality, 
exploration seismologists have developed techniques for processing actual field 
data (performing a series of experiments and "stacking" the data) so that the 
1-D problems are generally accepted as useful and worthy subjects of 
investigation. Consequently, numerous papers (for some interesting references, 
see the bibliographies of [2], [18]) on the 1-D problems can be found in the 
research literature. 

In many formulations of the seismic inverse problem, the medium is 
assumed to be the half-line x > 0 (with x = 0 the surface) while in others 
(especially some of those dealing with computational schemes) one finds the 
assumption of an artificial finite boundary (say at x = 1) at which no 
downgoing waves are reflected ( an "absorbing" boundary). For the 1-D 

formulation this condition is embodied in a simple boundary condition of the 
form (2.3); here q^ - /qj,(i) /q^li) and one can view this boundary condition 
as resulting from factoring the wave equation (2.1) at x = 1 and imposing 
the condition of "no upgoing waves" at x = 1. 

Equation (2.1) is a 1-D version of the equations for an isotropic elastic 
medium while (2.2) represents an elastic boundary condition at the surface x 
= 0 (qg represents an elastic modulus for the restoring force produced by the 
medium). 
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In the usual seismic experiment, the medium is assumed initially at rest 
so that Uq = Vq = 0. While our analysis below can, with some tedium, be 
extended to treat the case of parameter dependent initial conditions, we shall 
assume that Uq, Vq are given known functions. 

For our discussions here, we shall also assume that the source term f is 
to be estimated in some function space although frequently this term can be 
parameterized in terms of a Euclidean set of parameters, thus simplifying 
somewhat the analysis. We shall also assume that q^(x) = 1 to facilitate 

our arguments (otherwise the analysis is somewhat more tedious and involves 
the use of parameter dependent inner products). 

We reformulate the system (2.1)-(2.4) in variational or weak form, seeking 
a solution t -» u(t) on 0 < t ^ T , with u(t) e H^(n) , satisfying 


q,(l) 


(2.5) <Ujj, t> + <q2Du,Dt> + — Ut(t,l)v|<l) - q2(0) [q3u(t,0) - f(t)]t(0) = 0 


for all e H^(0), along with initial conditions 


( 2 . 6 ) 


U(0) = Ug , Uj(0) = Vg . 


Here and throughout, unless otherwise noted, < , > denotes the usual inner 
product in H° = and D = fx- The parameters q = (qj, qj, q 4 , f) are 
assumed to be in some subset of C(fl) x x x C(0,T) , although as we 
shall point out later, these smoothness requirements can be relaxed. 
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The system (2.5) - (2.6) can, for the purposes of analysis, be abstractly 
formulated using the state space H = x H°(ft) for states u(t) = (u(t,0), 
u(t,l), u(t, •))• To be precise, define 

V = ^ti,5,'l') e H : n = »K0), ^ = >1<1), >]; E H^(ft)j 

and, for v = (n,^,v) in H , the operators M^: H -♦ H, N^: H -* H by 
MqO = ( 0,0,v) = (0, ^ t,0). 

Q4 

We also define the functionals a: V x V -» R\ b(t): V -♦ R^ by 


a(v, 4,) = <q2Dv,D4i> - q2(0)q3v(0)4<0) 
b(t)+ = q2(0)f(t)+(0) . 


Then we can rewrite (2.5) - (2.6) as 


(2.7) <Mq“u’ <NQUj,4>fj + a(u(t),4>) + b(t)4< = 0 

where i(t), 4< e V. We note that in this case the operators Nq, b(t) and 
the functional a each depend on unknown parameters. 
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standard arguments [28, p.273] can be used to guarantee existence of 
solutions u to (2.7), (2.6) satisfying u e C(0,T;V), Uj e C(0,T;H), u^j e 
H°(0,T;V') with (2.7) being satisfied in a weak or distributional sense. 

Furthermore, one can rewrite (2.7) as a first order system and use semigroup 
techniques to argue that, under additional smoothness assumptions on the 
parameters and initial data, one obtains strong solutions that enjoy additional 
smoothness properties. We shall return to these considerations below. 

Turning to approximation and convergence arguments, we shall work with 
our system in the form (2.5), (2.6) although we could equivalently use (2.7) in 
our considerations. We consider Galerkin approximations on finite 
dimensional subspaces H^(fl), N = 1,2,..., and make the standing 

assumption on the orthogonal projections : H°(n) -» H^. 

Assumption A: If o e H°(0), then P^(t> -» <^ in H°. For each (j> e H^(0) , 

we have P^<j> -♦ 0 in 

If the observations for (2.1) - (2.4) for use in the least-squares functional 
are given in pointwise form u(l^,Xj) for the displacement or in an H° sense 
Ux(tj, •) for the strain, it suffices for the convergence and stability arguments 
to argue that u^(t;q^) -* u(t;q) in rf(P) whenever q^ -► q in an 

appropriate sense, where is the Galerkin approximation to the original 

system (2.5), (2.6). 

For parameters q^ = (qj, qg, q^, f^) in an admissible parameter set 
Q , the approximating systems are given by: Find u^(t) e satisfying for 

all 0 e 
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( 2 . 8 ) 


QjO) 

+ <q^ Du^,Dvjj> + 

^4 


uf(t,l)4<l) - qN(0) [qNuN(t,0) - fN(t)]t(0)=0, 


(2.9) 


uN(0) = pNujj 

uN(0) = . 


we make the standing 
Q = C(0) X X X 

{(q 2 - Qai Q 4 > f) € Q 1 qj(x) > v > 0, qg < - n < 0, 0 < 0 ^ q^ « a] 

for some fixed positive eonstants a,P,v,n. 

Suppose then that q^ -* q in Q , where {q^} is any convergent 
sequence in Q, and let u^(q^), u(^ denote the corresponding solutions to 
(2.8), (2.5) respectively. Under Assumption A, we see from the inequality 

|u^(q^) - u(^ ^ « u^(q^) - P^u(q)|^ + P^ u(^ - u(^ 


Regarding the admissible parameter set Q 
assumptions: 

Assumption B: The set Q is compact in the 

H^(0,T) topology and is contained in the set 
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that it suffices to consider z^(t) = u^(t;q^) - P^u(t;^ and argue the 
convergence z^(t) -» 0 in H^(0) for each t in [0,T], 

Defining the "potential energy" functional C) : x -* by 

(2-10) <Kq)(<l>>t) = <q2D(D,Dt> - q2(0)q3<D(0)»i<0) 

and the "boundary damping" functional B : x -• by 

q2(i) 

(2.11) B(q)(d),+) = «t>(l)>l<l) , 

^4 


we may use (2.5) (with q = ^ and (2.8) in 


<Z [J ,vji> = <(u^ - u + u - P^u)jj,t> 


to obtain 

<z^ ,vp> + <D{qN)(2; N + B(q^)(z^ ,t) 


(2.12) 


= <K®(u,vj,) + B(^(u^,v^) - <KqN)(pNu,4,) - B(qN)(pNu^,^,) 
+ <(I - pN)Utj,+> + [q^ (O)f(t) - qN(0)fN(t)]vp{0) 



- 14 - 


for all 
conditions 

(2.13) 
Choosing 

(2.14) 

(2.15) 

we obtain 


(2.16) 


E H^. In addition to this equation, satisfies the initial 

(see (2.6) and (2.9)) 


2^(0) = 0 , 2^(0) = 0. 


q/ = 2 ^ (which is in H^) in (2.12) and defining 


A4<u) = C<^(u, 2 ^) - <I<q^)(P^u, 2 ^) 


8N(t) = Q2(0)f(t) - qN(0)fN(t) , 


from (2.12) the equation 



+ <D(q^)(2^,2^)} + B(q^)(zj^,2j^ ) 



A<Ku^) + ^(t) 2N(t,0) 


+ B(q) (uj, 2 ^ ) - B(q^)(P^Uj, 2 ^j) + <(I-P^)Ujj,z^>. 


If we further define the total energy functional 
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E(qN)(zN) = \zf\^ + (t(qN)(zNzN) 


and the auxilary expressions (for notational convenience) 


(2.17) 


q^d) q^(l) 

5^(0 = ut(t,l) P''ut(t,l) 

Q4 I. 




(2.18) 


bN (t) = qN(0)qN pNu(t,0) - q/0)q3u(t,0) 


(2.19) 


A^(t) = q,Du(t) - q^DP^u(t), 


then we can rewrite (2.16) as 


1 ^ 

dt 


, E(qN)(zN(t)) + B(qN)(zN zN) 


( 2 . 20 ) 


= ±A<Ku) - A<KUj) + {5^(0 zN(t,0) } - 5N(t)zN(t,0) 


+ 5j^(t)z[^(t,l) + <(I - P^)Ujj,zf^>. 




We next observe that from Assumption B we have 


B(qN)(zN,zN) \z^(t,l)\' 
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while the inequality be « + fic^ implies, with a proper choice of 

constants, 


8f(t)zN(t,l) « 


a 
4 V 




Using these inequalities in (2.20) we obtain 


^ (i E(qN)(zN(t)) - a<Ku) - 5N(t)zN(t,0)) 


( 2 . 21 ) 


« - A<D(Uj) + ^ |Si(t)]^ - 63 t(t)z^(t, 0 ) + <(I - P^)Utj,z^>. 

Integrating this expression and using the facts that A<I>(u)|j_q = 0 , z^(0) = 0 , 
and E(q^)(0) = 0 , we thus find 


( 2 . 22 ) 


E(qN)(zN(t)) « 2A<Ku) + 25^ (t)z^(t,0) + 2rG^(t)dt 

•>0 


where is defined as the function on the right side of the inequality 

(2.21). Finally, to make use of this bound on the parameter-dependent energy, 
we employ the following set of inequalities: 
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2AC<u)(t) $ 


+ ^loz^d)! + ^ 1 


25N(t)zN(t,0) < 


%rk(.,o)r . 


-2a<KUj)(0 ^ 

^ i<«)r 

+ ^|dz“({)| + 

*s)i^ piAs.or, 

-25^(Oz''(t,0) 

« Whsi 

(o|% ^|z''a,o)r 


2<(I - pN)u,,, 

z^> < |( 

I - p-kI^ * |zi*|^ 


E(qN)(zN(t)) 3. 

hrwl’ 

+ v|Dz^(t)| + vnjz 



Then (2.22) may be replaced by 


(2.23) 


where 


E(zN(t)) < rN(t) + I 


t _ 

E(z^(0)dt 


0 


(2.24) E(zN( 0) = |zf(0 I' + 7 |dzN( 0|' + p|z^(^,0)| ' 
and 

(2.25) rN(t) = J-|AN(t)|' + — |5N(t)|' + — |5N(t)P + f F(0d4 

V ' ' vn vti Jq 


with 
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(2.26) F(0 lAr(OT+ 4l®^t(0r 

+ |(I - P%tt(o| + ^ls3t(0l • 

Thus, from (2.23) and the Gronwall inequality, to establish that 
E(z^(t)) -* 0 (and hence that z?^(t) -» 0 in H^(n)), it suffices to argue that 
r^(t) -♦ 0 for each t in (0,T], In view of the definitions (2.15), (2.17) - 
(2.19), this convergence is easily argued under the Assumptions A, B (where 
q^ -* q in the Q topology) and that u(t) e H^(n), u^(t) e H^( 0 ), u^^ e 
H°([0,T] X n). To complete the discussions of this section, we shall comment 
on a number aspects of the above-outlined results in the form of several 
remarks. 

Remark 2.1 . Recalling Assumption B, we note that the formulation above 

requires the elastic moduli q^ lie in C(n) and that the compactness 
properties of Q with respect to this component be in the C( 0 ) sense. 

These are readily weakened by formulating the problem with the q^ 
component replaced by ( 02 ( 6 ), q 2 (l)> Q 2 ) x R^ x L“(n) - see equation 

(2.5) - with a corresponding change in the Q topology employed for the 
compactness statement in Assumption B. This is especially useful if one wishes 
to consider discontinuous elastic moduli (an important formulation in 
"multi-layered" seismic problems) and the ease with which such modifications 
are treated in the variational framework above make it ideally suited for 
convergence arguments when estimating discontinuous parameters. 
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Remark 2.2. In cases where the initial functions Uq, Vg of (2.6) also depend 
on unknown parameters, i.e., Ug = Ug(q), Vg = Vg(q) , then the initial conditions 
(2.13) must be replaced by z'^(O) = P'^[Ug(q^) - Ug(^] , z^(0) = P^[Vg(q^) - 

Vg(^]. The convergence arguments can be extended to this case if one makes 
appropriate smoothness assumptions on Ug, Vg as functions of the parameters 

q. 


Remark 2.3. Assumption A, the fundamental approximation hypotheses on 
, is readily shown to hold if one chooses either the standard piecewise 
linear or cubic splines as approximation elements and hence these 

approximation schemes are included as special cases in the above treatment. 
The arguments and assumptions must be modified slightly if one wishes to 
include spectral families such as Legendre polynomials. (One obtains the 
convergence P^<^ -♦ <t> in for <j» e and Assumption A must be 

modified accordingly. This, in turn, requires that u(t),u^(t) be in in 

order to carry out the convergence arguments above.) 

Remark 2.4. The presentation here assumes that one is performing the 
optimization in the least-squares fit-to-data over the admissible parameter set 
Q. In general, this is an infinite dimensional function space in the com- 
ponents q^ and f. Thus one requires, as explained in Section 1, a second 
approximation family and a double limit procedure. Recalling our 

discussions from Section 1, we note that in the problems considered in this 
section it suffices to use the set Q = C(n) x x x H^(n) in the com- 
pactness and approximation statements involving and Q. 
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Remark 2.5 . In the presentation above, we found that to guarantee the desired 
convergence, it suffices to have u(t) and Uj(t) in H^(0) and Uj^ e H°([0,T]xn) 
where u is the solution of (2.5), (2.6) corresponding to any limit parameters 
q in Q. Without further smoothness assumptions on the problem data one 
cannot readily guarantee this desired regularity for u. For example, if 
g L“(n), Ug g H^(0), Vg g H°(0) and f g H°(0,T) , as we have already observed 
one can use variational theory to guarantee existence of weak solutions with 
Ujj g H°(0,T; H"^). If, however, we assume q^ g H^, Ug g Vg g and f 
g , then semigroup arguments similar to those given in [15] can be made to 
obtain strong solutions with u(t) g u^(t) g H^, and u^^ g H°([0,T]xO). Thus, 
Qj in and sufficient smoothness on Ug, Vg, f will yield the desired 

smoothness for convergence. 

Remark 2.6 . The problem considered in this section was also investigated in 
[13] using a semigroup (Trotter-Kato approximation theorem) approach. Several 
differences in the results are noteworthy. First, we note that in the 

variational framework above, the approximating (H^ c: H^(n)) basis elements 
are not required to satisfy parameter dependent boundary conditions (contrary 
to the situation in [13]). Furthermore, the unknown source term f in the 
boundary condition (2.2) can be treated directly here (in [13], the treatment 

required a transformation to a system with homogeneous boundary conditions 
and nonhomogeneous equation) and this relaxes the smoothness and compactness 
assumptions needed on the f component of Q. (A similar relaxation can 

be obtained in the framework of [13] using a slightly different method for 

transforming the nonhomogeneity in the boundary condition to the system 
equation.) In this regard we also note that the approach in [13] requires 
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convergence qj in H^(n) as opposed to in C(n) (or x x L“(n) - 

see Remark 2.1). Thus the related compactness assumption on the Qj 

component Qj of Q is relaxed from to C compactness. This is 
potentially important in several respects. The characterization of compactness in 
C(n) is somewhat more natural (e.g. the Arzela-Ascoli lemma) than that in 
(e.g. see the embedding lemmas in [1]). Furthermore, in extending the ideas 
here or in [13] to treat estimation of discontinuous coefficients, compactness in 
(or a piecewise compactness) is more awkward and tedious to 

formulate than a concept of piecewise C or L“ compactness. Finally, we note 
that the mode of convergence required in the compactness of Q also dictates 
the type of approximating families one can use. We recall that the 

interpolation operators (see [13] [32]) for both piecewise linear and cubic 

spline approximations satisfy I^(q) -* q uniformly in q e Q in either C 
or whenever Q c However there are occasion where one might 

desire to use the weaker convergence requirement (e.g. when dealing with 
discontinuous coefficients and a piecewise C topology), in which the 
variational formulation of the above presentation can prove advantageous. 
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Large Flexible Structures 


In one important class of parameter estimation problems (see 
[5,20,24,29,35]), one wishes to estimate structural parameters (stiffness, damping, 
loading, etc.) in complex continuum models for elastic structures. The methods 
discussed in this paper can be successfully applied to such problems. To 
explain the approach, we consider a variable structure cantilevered damped 
beam with a tip body and base acceleration. Such a model might be used for 
example to describe the vibrations of shuttle attached payloads or large 

flexible spacecraft members. To be more specific we consider an 

Euler-Bernoulli beam of length t with viscoelastic damping (a Kelvin-Voight 
solid) and tip mass m (instead of a tip body). Then the equations for 
transverse (planar) vibrations in the presence of an axial force due to base 
acceleration are given by 







t > 0, 0 < X < 

r ^ 

r a^u 

a^u . 

+ a — ll 

■ ax 

1^12 a^2 + ^3 

dx^dt^ 

ax J L=j! 
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u(0,.) = (t> , Uj(0,-) = vj/ . 
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Here = p(x) is the linear mass density, Qj = EI(x) is the flexural 
rigidity or stiffness, Qg = Cjjl(x) is the viscoelastic damping coefficient, 

= m is the tip mass, f is a distributed lateral load, and g is a 
transversely applied force on the tip mass. The internal tension ct due to 
axial loading from base acceleration is assumed to be a known function of q^ 
= p 5 d 4 = ni , and ds = = the base acceleration. From observations of the 

beam (displacement, velocity or strain), one might desire to estimate the 
parameters q = (q^, q^, dg, d 4 , dg) = (p,EI,CjjI,m,ao) from a specified para- 
meter set Q. 

For such problems one can use a semigroup-Trotter-Kato approximation 
formulation to develop computational procedures - e.g. see [5,26,31]. However, 
some advantages are obtained in using a variational formulation (with "state" 
u(t) = (u(t,-), u(t,i)) ) similar to that in (2.5), (2.6) or (2.7). This is done in 
[6] and [15], [16] where detailed arguments for convergence are given. We shall 
not discuss them further here, except to note that the variational approach 
allows for a much weaker compactness criterion on the admissible parameter 
set Q. For example one can hypothesize compactness in the C(0,jj) norm 
(or in the L“(0,i) norm in a sense similar to that mentioned in Remark 2.1 
above) with respect to the components representing El and c^I. (Compare this 
with the or H^weak compactness assumptions in [5] and [26].) 
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IV. Bioturbation in Abyssal Sediments 


In the previous two sections, we introduced problems (seismic and flexible 
structures) for which one can investigate parameter estimation techniques using 
either a semigroup formulation or a variational formulation. In this section 
and the next, we mention briefly two other classes of problems which are not 
readily treated with a semigroup formulation yet which can easily be analyzed 
in a variational setting. We first turn to problems related to the estimation of 
the effects of biological mixing in abyssal sediments. 

Sediment formation in lakes and deep seas is of great importance to 
geophysical scientists who use core samples of this sediment in their 

investigations of the history of the earth. Unfortunately, the historical records 
contained in these core samples are often perturbed by a phenomenon called 
bioturbation [36] which is the mixing of sediments due to the activities of 

organisms near (on the order of 20-40 cm.) the sediment-water interface. 
These activities consist primarily of burrowing (e.g., for safety) and 

ingestion-excretion and are not easily described quantitatively. 

An important goal of some geologists is to understand (quantitatively) 
bioturbation well enough so as to enable one to remove its effects and 

properly interpret the data in core samples, thereby sharpening the details in 
these geologic records. A number of increasingly sophisticated mathematical 
models have been proposed and a brief review of a number of these is given 
in [23]. One model of interest is the one proposed by Guinasso and Schink in 
[23]. 
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Briefly, the model involves one-dimensional (depth) transport equations 
for a moving chamber (assumed uniform in horizontal directions) in which 
mixing and advective flow of material takes place. Depth in the chamber is 
represented by coordinates x, 0 $ x ^ L, and the chamber (and hence co- 
ordinate system) is assumed to be moving upward with a velocity Qj = Q 2 (t) 
[corresponding to sedimentation rate or build-up] so that it is always located in 
the top L cm. of the sediment, i.e., x = 0 is always at the 

water-sediment interface. The bottom of the chamber x = L is located at 
that depth beyond which (it is assumed) no further changes (i.e. no 
bioturbation) in the historical records occur. If u = u(t,x) is the 

concentration of material (e.g., shards of ash, tracer, etc.) with whose movement 
one is concerned, a model based on mass balance in the chamber, Fickian flux 
for the bioturbation, and appropriate boundary flux considerations is given by 


(4.1) 


9u 

at 


au 


u uu -s QXX 

— f'4i(x) — } - Q2(0 — 

fly ^ ax J ^ a-x 


ax 


ax 


ax 


t>0, 0<x<L, 


(4.2) 


au 

-qi(0) — (t,o) + Q2(t)u(t,o) = 0 

^ ax 


(4.3) 


au 

-q^(L) — (t,L) = 0 . 
ax 


Here is a depth dependent "bioturbation" coefficient. 


To understand the effects of bioturbation on the distribution of 


material concentrations in core samples, it is sufficient then to know the 
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parameters q = (q^.qj.Qg) , qg = L and, of course, know that use of these 
parameters in the model gives one an accurate quantitative description of 
concentrations found in core samples. Given observations from core samples, 
this leads to a parameter estimation problem involving estimation of the 
functions q^ and qj and the chamber length qg. In [17], it is shown how 
to formulate and treat such problems in a discrete semigroup - Trotter-Kato 
approximation framework if one assumes that q^ is chosen from a class of 
functions with finite dimensional parametric representation and is 

independent of time. If one wishes to treat more general problems of 
estimating x -♦ q^(x) and t -• qgCO in general classes of functions, these are 
not so easily investigated using a semigroup setting (note that (4.2) involves 
time dependent unknowns). However, a variational formulation not unlike that 
given in [10] provides an amenable framework in which convergence 
arguments can be given under rather weak compactness assumptions on the 
admissible parameter sets. These arguments, to be given elsewhere, are similar 
in spirit to those given in [10] for transport problems involving estimation of 
spatially and temporally dependent coefficients. 
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V. Nonlinear Population Dispersal 

In this section we turn to a brief discussion of estimation and a 
variational formulation for problems that are typical of population dispersal 
problems with transport coefficients (such as "diffusion" coefficients) that are 
density dependent. Nonlinearities of the general type we consider here are 
also important in porous media estimation problems. 

Among the fundamental mechanisms often of interest to investigators in 
population dispersal (see [9,11,14,30,33,34]) are (in addition to the usual 
emigration-immigration, birth-death mechanisms): a dispersive mechanism 

associated with random movement or foraging; an attractive or repulsive force 
which induces directed movement of population members toward favorable or 
away from unfavorable environmental surroundings; and a mechanism 
representing population pressure due to interference between individuals in the 
population. In mathematical models for transport including such mechanisms, 
it is density dependent higher order terms that present difficulties in 
theoretical (and computational) considerations. To illustrate how a variational 
framework may be used for such problems, we shall sketch fundamental 
convergence arguments for problems involving estimation of the parameter 
function q in simple models of the form 

au a du V 

(5.1) TT = 7" [ q(t.x,u)— ] t > 0, X £ n = (0,1), 

at ax ax 


(5.2) 


u(t,0) = u(t,l) = 0 
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u(0,.) = U(j. 

It’s not difficult to extend the ideas to many of the more detailed models 
(which include other desirable, but more easily treated mathematically, 
transport terms) studied in [9,14,30,33,34], While we shall formulate the 

problem here in terms of estimating rather general "diffusion" coefficients q 
in (5.1) from a rather broadly defined class Q , we actually have as basic 
motivation the treatment of coefficients that are bounded below by 
density-independent base values, that are saturation limited with rates that are 
affine as a funetion of density in the range between the base-value and 
saturation thresholds. To be precise, our development has been motivated by 
problems where ^ -* q(t,x,0 is continuous and has the form (see [14]) 


m(t,x) 5 ^ ^(t,x) 


(5.4) 


q(t,x,0 = 


a(t,x) + p(t,x)i Co(t,x) « 5 « 5i(t,x) 


M(t,x) 5i(t,x) « i . 


In such problems we seek to estimate ot,P,5o,5p (which determines q in 
(5.4) if the continuity assumption is invoked) from sets A,B,rQ,fj respectively. 
We shall sketch our ideas in terms of rather general conditions on the 
parameter set Q , noting that under appropriate assumptions on the sets 
A,B,rQ,rj , the above example is included as a special case. 
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We first rewrite (5.1) - (5.3) in variational form, which consists of 
finding u(t) e Hj(n) satisfying 

(5.5) + <q(t, -, u)Du,D 4 i> = 0 
for all v|; e Hj(n) along with initial conditions 

(5.6) u(0) = Uq . 

To define an approximate state system, we assume we have chosen a family of 
finite-dimensional state spaces c: hJ( 0) with orthogonal projections : 

Hj(ft) - rN in the H°(n) inner product. We also assume Assumption A of 
Section 2 holds with replaced by H^. The approximate systems are then 

given by seeking u^(t) e satisfying 

(5.7) <u^,vjj> + <q^(t, •,u^)Du^,Dvj;> = 0 

for all e and 

(5.8) uN(0) = P^Uq . 

The parameters q^ are to be chosen from some admissible parameter set Q. 

In the usual manner (see Sections 1,2), for a convergence and stability analysis 
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one desires to argue that -» q in Q implies u^(q^) -► u(^ for arbitrary 
sequences {q'^} in Q. The mode of convergence in Q is, of course, also the 
sense in which we wish to define "compactness" of Q. In this 

particular example, we assume Q "compact" in the following sense: 

Assumption C : Any sequence in Q has a convergent subsequence {q’') 

with limit q in Q in the sense 

(5.9) |q''(t,-,v) - q(t,.,v)|^- 0 in H°(0,T) 

for every v € L“(n). 

In the case where Q consists of functions of the form given in (5.4), it is 
straightforward to translate the compactness criteria of Assumption C into 
easily verifiable compactness criteria on the parameter sets A,B,rQ,rj. 

Further assumptions (again motivated by and easily verified for sets 
containing functions of the form (5.4)) on Q are necessary for convergence 
arguments and we therefore assume the following: 

Assumption D: There is a constant v > 0 such that <q(t, •,v)D>j^,D 4 y> "i- 

vl 4 <|^ for every q e Q, v,«|, e Ho(fl). 

Assumption E: There is a constant Mq > 0 such that |q(t,-,v). 4 >|Q « 

Mq|v|<|q for all V £ L“(n), e H°(n), q e Q. 
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Assumption F: There exists B e L“((0,T) x (1) such that |q(t,x,5) - q(t,x,n)| 

$ B(t,x)|t - n| for all \,x\ e and all q e Q. 

We note that Assumption F implies existence of a constant K such that 
|q(t,-,v) - q(t,.,u)|o ^ K|v - u|o for all v,u e H°(n). 

We are now in a position to outline convergence arguments, for which it 
suffices (in the usual manner) to consider z?^(t) = u^(t) - P^u(t) where u^, 
u , the solutions of (5.7), (5.8), (5.5), (5.6) corresponding to q^, q, respectively, 
where q^ -» q in the sense of (5.9) given in Assumption C and {q*^} is any 
such sequence in Q. 

From (5.5) - (5.8) we obtain ^(0) = 0 and 

(5.10) <zf ,+> = <q^(t,.,u)Du - q*^(t,-,u^)Du^, Dij;> + <(I - P^)Uj,vj^> 

for all 6 H^. Choosing in (5.10) and adopting the notation "^v), 

q^(v) for ^t,-,v), q^(t, .,v) throughout, we obtain 

7 — = <q(u)Du - q^(u^)Du^,Dz^> + <(I - P^)u,,z^>, 
dt ' ' 

or 

(5.11) 7 — |z^|^+ <q^(u^)Dz^,Dz^> = <q(u)Du - q^(u^)DP^u,Dz^> 

dt ' ' 


+ <(I - pN)Uj,zN>. 
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From Assumption D we immediately find 


( 5 . 12 ) 



+ v|z^P ^ <q(u)Du - q^(u^)DP^u,Dz^> + — |(I - P^)UtP 


+ 


1 

2 



2 


Considering the first term on the right side of this inequality we find (all 
norms are the norm unless otherwise indicated) 


(5.13) <q(u)Du - q^(u^)DP^u,Dz% = <(q(u) - q^(u))Du,Dz^> 

+ <(q^(u) - q^(u^))Du,Dz^> + <q^(u^)D(u - P^u), Dz^> 

< i |[q(u) - q%)]Du|' + j |dzN|' 

+ u * q%^)]DuP + 7 |dz^P 

+ ^|q^(u^)D(u - P^u)P + j|dz^| . 

The terms in the right side of this inequality can be estimated (we use 
Assumptions E and F) as follows: 

|q^(u^)D(u-P^u)| ^ Mq|d(u - P^u)| 
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|(q^(u) - q^(u^))Du| < |du|„k|u - | < |du|„K|^|(I - P^)u | + |z^|) 

|(q(u) - q^(u))Du| ^ |du| |q(u) - q'^(u)|„ 

Using these estimates and (5.13) in (5.12), we obtain 



where 


U = 




2 

CD 


and 


G^(t) = 7 |(I - P^)uJ^ + ^|D(u - pNu)|^ + ^Idu L|(i . pN)u|^ 

The remaining arguments are similar to those (integration, Gronwall, etc.) of 
Section 2. One easily obtains the convergence z'^(t) -► 0 in H°(n) for each 
t and, actually, considering again (5.14), the additional results -♦ 0 in 
H°(0,T; H^(0)). Of course, appropriate smoothness (e.g. Uj e H°([0,T] x fl) , Du 
6 H°(0,T; L“((1)) ) assumptions on the solution u must be invoked. 

We remark that the ideas sketched here readily extend to 

multi-dimensional domains n (see [10]) and that computational efforts based 
on the variational framework have in preliminary calculations (see [14]) been 
promising. 
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